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Abstract 

This paper considers cooperative spectrum sensing algorithms for Cognitive Radios which focus on 
reducing the number of samples to make a reliable detection. We develop an energy efficient detector 
with low detection delay using decentralized sequential hypothesis testing. Our algorithm at the Cognitive 
Radios employs an asynchronous transmission scheme which takes into account the noise at the fusion 
center. We start with a distributed algorithm, DualSPRT, in which Cognitive Radios sequentially collect 
the observations, make local decisions and send them to the fusion center. The fusion center sequentially 
processes these received local decisions corrupted by Gaussian noise to arrive at a final decision. 
Asymptotically, this algorithm is shown to achieve the performance of the optimal centralized test, 
which does not consider fusion center noise. We also theoretically analyse its probability of error and 
average detection delay. Even though DualSPRT performs asymptotically well, a modification at the 
fusion node provides more control over the design of the algorithm parameters which then performs 
better at the usual operating probabilities of error in Cognitive Radio systems. We also analyse the 
modified algorithm theoretically. Later we modify these algorithms to handle uncertainties in SNR and 
fading. 

Index Terms 

Cooperative spectrum sensing, distributed detection, sequential detection. 

I. Introduction 

Presently there is a scarcity of spectrum due to the proliferation of wireless services. Cognitive 
Radios (CRs) are proposed as a solution to this problem. They access the spectrum licensed 
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to existing communication services (Primary Users) opportunistically and dynamically without 
causing much interference to the Primary Users. This is made possible via spectrum sensing by 
the Cognitive Radios (Secondary Users), to gain knowledge about the spectrum usage by the 
primary devices. However due to the strict spectrum sensing requirements ([1], [25]) and the 
various inherent wireless channel impairments spectrum sensing has become one of the main 
challenges faced by the Cognitive Radios. 

Multipath fading, shadowing and hidden node problem cause serious problems in spectrum 
sensing. Cooperative (decentralized or distributed) spectrum sensing in which different cognitive 
radios interact with each other exploiting spatial diversity, ([1], [22]) is proposed as an answer 
to these problems. Also it improves the probability of false alarm and the probability of miss- 
detection. Cooperative spectrum sensing can be either centralized or distributed [25]. In the 
centralized algorithm a central unit gathers sensing data from the Cognitive Radios and identifies 
the spectrum usage ([16], [25]). On the other hand, in the distributed case each Secondary User 
collects observations, makes a local decision and sends to a fusion node to make the final decision. 
Centralized algorithms provide better performance but also have more communication overhead 
in transmitting all the data to the fusion node. In the distributed case, the information that is 
exchanged between the Secondary Users and the fusion node can be a soft decision (summary 
statistic) or a hard decision ([16]). Soft decisions can give better gains at the fusion center but 
also consume higher bandwidth at the control channels (used for sharing information among 
Secondary Users). However hard decisions provide as good a performance as soft decisions 
when the number of cooperative users increases ([5]). 

Spectrum sensing problem can be formulated in different ways, two of them being Neyman- 
Pearson framework (fixed sample size detection, e.g. matched filter, energy detector etc.) and 
sequential detection framework, reducing the number of samples taken for deciding if a primary 
is transmitting or not. It is reasonable to consider the sequential framework for spectrum sensing 
as it enables the detector to decide upon the decision more quickly than the fixed sample size 
counterpart. More precisely there are two types of sequential detection: one can consider detecting 
when a primary turns ON (or OFF) (change detection, see [2], [11] and the references therein) 
or just testing the hypothesis whether the primary is ON or OFF ([6], [20], [24] and references 
therein). In sequential hypothesis testing one considers the case where the status of the primary 
channel is known to change very slowly, e.g., detecting occupancy of a TV transmission. Usage 
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of idle TV bands by the Cognitive network is being targeted as the first application for cognitive 
radio. This framework also finds application in quickest detection of unoccupied spectrum in 
slotted Primary User transmission systems (e.g., cellular systems). In this setup (minimising 
the expected sensing time with constraints on probability of errors) Walds' SPRT (Sequential 
Probability Ratio Test) provides the optimal performance for a single Cognitive Radio ([21]). 
But the optimal solutions for cooperative setup are not available. 

We consider sequential hypothesis testing in cooperative setup. Feedback from the fusion 
node to the CRs can possibly improve the performance. However that also requires an extra 
signalling channel which may not be available and also has its own cost. Thus in our framework 
we assume that there is no feedback from the fusion center to CRs. In sequential decentral- 
ized detection framework, optimization needs to be performed jointly over sensors and fusion 
center policies as well as over time. Unfortunately, this problem is intractable for most of the 
sensor configurations ([13], [23]). Specifically there is no optimal solution available for sensor 
configurations with no feedback from fusion center and limited local memory, which is more 
relevant in practical situations. Recently [7] and [13] proposed asymptotically optimal (order 1 
and order 2 respectively) decentralized sequential hypothesis tests for such systems with full 
local memory. But these models do not consider noise at the fusion center and assume a perfect 
communication channel between the CR nodes and the fusion center. Also, often asymptotically 
optimal tests do not perform well at finite number of observations. [24] takes into account noisy 
channels between local nodes and fusion center in decentralized sequential detection framework. 
But optimality of the tests are not discussed and the paper is more focussed in finding the best 
signalling schemes at the local nodes with the assumption of parallel channels between local 
nodes and the fusion center. Also fusion center tests are based on the assumption of perfect 
knowledge of local node probability of false alarm and probability of miss -detection. However 
our test is shown to be asymptotically optimal, we assume MAC channel at the fusion center 
and the test is not based on the local node probability of error. Furthermore uncertainty in the 
received Signal to Noise Ratio (SNR) at the CRs and fading channels between primary and 
CR requires a composite hypothesis testing extension to the decentralized sequential detection 
problem and is not considered in these references. 

In this paper, we propose DualSPRT. Unlike the previous works on cooperative spectrum 
sensing using sequential testing ([24], [20] and references therein) we analyse this algorithm the- 
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oretically also. Asymptotic properties of DualSPRT are studied and it is found that as thresholds 
increase, performance of the algorithm approaches the optimal centralized sequential solution, 
which does not consider fusion center noise. In addition, we generalize this algorithm to include 
channel gain uncertainty. Furthermore, we consider the receiver noise at the fusion node and use 
physical layer fusion to reduce the transmission time of the decisions by the local nodes to the 
fusion node. Later we improve over DualSPRT with more emphasis over the ease of design of 
parameters. Furthermore we introduce a new way of quantizing SPRT decisions of local nodes 
and extend this algorithm to cover SNR uncertainties and fading channels. We also study its 
performance theoretically. We have seen via simulations that our algorithm works better than 
the algorithm in [13] and almost as well as the algorithm in [7] even when the fusion center 
noise is not considered and the Multiple Access Channel (MAC) layer transmission delays are 
ignored in [7] and [13]. 

This paper is organised as follows. Section II presents the model. Section III provides Du- 
alSPRT algorithm. Theoretical performance of the algorithm is given. Section IV provides the 
asymptotic optimality of DualSPRT. In Section V we improve over DualSPRT. We compare 
the different versions so obtained and also compare them with existing asymptotically optimal 
decentralized sequential algorithms. Sections VI extends these algorithms to consider the effect 
of fading and SNR uncertainty. Section VII concludes the paper. 

II. System Model 

We consider a Cognitive Radio system with one primary transmitter and L Secondary Users. 
The L nodes sense the channel to detect the spectral holes. The decisions made by the Secondary 
Users are transmitted to a fusion node via a Multiple Access Channel (MAC) for it to make a 
final decision. 

Let Xkj be the observation made at Secondary User I at time k. The {X k j, k > 1} are 
independent and identically distributed (i.i.d.). It is assumed that the observations are independent 
across Cognitive Radios. Based on {X U; i, n < k} the Secondary User / transmits Y k j to the 
fusion node. It is assumed that the secondary nodes are synchronised so that the fusion node 
receives Y k = Ym=\ + where {Z k } is i.i.d. receiver noise. The fusion center uses {Y k } 
and makes a decision. The observations {X kl } depend on whether the primary is transmitting 
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(Hypothesis Hi) or not (Hypothesis H ) as 

Undergo: X k>l = N kh A; = 1,2,..., 

Under H x : X Kl = hiS k + N kh k = 1, 2, . . . , 

where hi is the channel gain of the I th user, S k is the primary signal and N k j is the observation 
noise at the I th user at time k. We assume {N k j, k > 1} are i.i.d. Let N be the time to decide 
on the hypothesis by the fusion node. We assume that iV is much less than the coherence time 
of the channel so that the slow fading assumption is valid. This means that hi is random but 
remains constant during the spectrum sensing duration. 

The general problem is to develop a distributed algorithm in the above setup which solves the 
problem: 

mmE DD = £[7V|#*] 
subject to Pi (Reject Hi) < a 1; andP (Reject H ) < a , (1) 

where Pi is the probability measure when Hi is the true hypothesis, i = {0, 1}, and < a , «i < 
1. We will separately consider E[N\Hi] and E[N\H ]. It is well known that for a single node 
case (L = 1) Wald's SPRT performs optimally in terms of reducing E[N\Hi] and E[N\H ] for 
given probability of errors. Motivated by the optimality of SPRT for a single node, we propose 
using DualSPRT in the next section and study its performance. 

Throughout this paper, Ei denotes expectation under hypothesis Hi. We use Pmd for Pi(reject 
Hi) and P F a for P (reject H ). In case of E DD , hypothesis under consideration can be understood 
from the context. 

III. Decentralized Sequential Tests: DualSPRT 
In this section we develop DualSPRT algorithm for decentralized sequential detection. 

A. DualSPRT algorithm 

To explain the setup and analysis we start with the simple case, where the channel gain, hi = 1 
for all Vs. We will consider fading in the next section. DualSPRT is as follows: 
1) Secondary node /, runs SPRT algorithm, 

W Kl = W fc _i,, + log [f hl {X Kl ) /f 0tl (X ktl )],&>!, W ,i — (2) 
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where fii is the density of X k j under Hi and f j is the density of X k j under H (w.r.t. a 
common distribution). 

2) Secondary node / transmits a constant b\ at time if W k ,i > 7i or transmits bo when 
Wfc,; < -7o> i-e., 

Y hJL = bx I{W k ,i > 71} + 6 l{W k ,i < -70} 

where 70, 71 > and I{A} denotes the indicator function of set A. Parameters bi, b , 71, 70 
are chosen appropriately. 

3) Physical layer fusion is used at the Fusion Centre, i.e., Y k = Yla=i ^k,i + Z k , where {Z k } 
is the i.i.d. noise at the fusion node. 

4) Finally, Fusion center runs SPRT: 

F k = F fc _! + log [# m (Y k ) /g^ (Y k )] , F = 0, ^, > 0, (3) 

where g_ fi0 is the density of Z k — /i and is the density of Z k + /i and [L\ being 
positive constants appropriately chosen. 

5) The fusion center decides about the hypothesis at time iV where 

N = M{k : F k > or F k < -/3 } 

and Po, (3\ > 0. The decision at time iV is Hi if Fn > fix, otherwise H . 
Performance of this algorithm depends on (71,70, /?o, b±, bo, (JLi, /io)- Any prior information 
available about H or Hi can be used to decide constants. Also we choose these parameters 
such that the probability of false alarm/miss-detection, Pf a /P m d at local nodes is higher than 
Pfa/Pmd- A good set of parameters for given SNR values can be obtained from known results 
of SPRT. 

Deciding at local nodes and transmitting them to the fusion node reduces the transmission 
rate and transmit energy used by the local nodes in communication with the fusion node. Also, 
physical layer fusion in Step 3 reduces transmission time, but requires synchronisation of different 
local nodes. If synchronisation is not possible, then some other MAC algorithm, e.g., TDMA 
can be used. 

Using sequential tests at SUs and at FC (without physical layer synchronization and fusion 
receiver noise) has been shown to perform well in ([6], [20]). In the next section we analyse the 
performance under our setup. 
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B. Performance Analysis 

We first provide the analysis for the mean detection delay Edd and then for Pfa- 



At node I, let 



\ Qg fi,i(X kjl ) 



fo,i{Xk,i 



log fi,i(X k j) 



We will assume S i: i finite throughout this paper. Sometimes we will also need pf l < oo. When the 
true hypothesis is Hi, by Jensen's Inequality, 5ij > 0. At secondary node I, SPRT {W k j, k > 0} 
is a random walk with expected drift given by Sij. 
Let 

N t = M{k : W ktl i (-7o, 7i)}, = inf{k : W Kl > 7 i} and JV° = inf{fc : W k j < - 7o }. 

Then JV, = min{^°, iV, 1 }. Also let = inf{fc : F k < -f3 } and A^ 1 = inf {k : F fc > /3 X }. Then 
stopping time of DualSPRT, iV = min(A^ 1 , N°). 

In order to have P F a = Paid, we choose 7i = 7o = 7, Pi — Po — (3, bi = —b = b and 
A*i = A*o = A*- Of course Pp^ and Pmd can be taken different by appropriately choosing 71, 70, 
/3i and fio and the analysis will carry over. 

1 ) E DD Analysis: At the fusion node F k crosses (3 under Hi when a sufficient number of local 
nodes transmit bi . The dominant event occurs when the number of local nodes transmitting are 
such that the mean drift of the random walk F k will just have turned positive. In the following 
we find the mean time to this event and then the time to cross (3 after this. The E DD analysis 
is same under hypothesis H and Hi. Hence we provide the analysis for Hi. 

The following lemmas provide justification for considering only the events {Nf} and {iV*} 
for analysis of E DD = E[N\Hi\. 

Lemma 1. For i — 0, 1, Pi(N t = Nf) — >■ 1 as 7 — > 00 and Pi{N = N l ) — > 1 as 7 — >• 00 and 

{3 ->• 00. 

Proof: From random walk results ([8, Chapter IV]) we know that if a random walk has 
negative drift then its maximum is finite with probability one. This implies that Pi{Nf < 00) — > 
as 7 —7- 00 for i ^ j but Pi(N'{ < 00) = 1 for any 7 < 00. Thus Pi(Ni = Nl) — > 1 as 7 — > 00. 
This also implies that as 7 — > 00, the drift of F k is positive for Hi and negative for H . Therefore, 
Pi(N = JV*) -»• 1 as 7 -»• 00 and f3 -+ 00. ■ 
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Lemma 2. Under H iy i = 0, 1 an J j i, 

(a) \Ni — Nf\ a.s. as 7 — > 00 and lim^oo ^ = Hindoo = D ^ ^ ^ and in L 1 . 

(b) \N — N l \ — > a.5. anJ lim ^ = lim ^ a.*. anJ in L 1 , as 7 — >■ 00 and /3 — >■ 00. 

Proof: Under P , 

N? I{N? <N{} <N t < A°, (4) 

and since Po^ < N[] — > 1 as 7 — > 00, | — ]Vj| — )■ a.s. as 7 — >■ 00. Also from Random 
Walk results ([8, p. 83]), Nf ft l/D(/ ,i||/i,i) a.s. and P[A°]/ 7 -> l/D{f^\\f^). Thus we 
also obtain Nift — > 1 /Z)(/ ,z I l/i,/) a - s - an d in L 1 . Similarly the corresponding results hold for 
N and A as 7 and (3 — > 00. (4) holds in the expected sense also. ■ 
Thus when 7 is large, we can approximate Ei[Ni\ by 7-D(/i,z||/o,z)- Also under Hi, by central 
limit theorem for the first passage time A/ (Theorem 5.1, Chapter III in [8]), 

A?~Aaf,^). (5) 

From Lemma 2, we can use this result for iVj also. Similarly we can obtain the results under 
H and at the fusion node. Let S 3 iFC be the mean drift of the fusion center SPRT Fk, under H, 
when j local nodes are transmitting. Let tj be the point at which the drift of F k changes from 
5{~p C to S J iFC and let Fj = E[F tj -i], the mean value of F k just before transition epoch tj. The 
following lemma holds. 

Lemma 3. Under H it i — 0, 1, as 7 — >• 00, 

PiiDecision at time t k is Hi and t k is the k th order statistics of N{, N%, . . . , N l L ) — > 1. 

Proof: From Lemma 1, 
Pj (Decision at time t k is Pj and t k is the A;" 1 order statistics of A{, A^, • • • , N l L ) 

> P(A/ < Nf,j ^ i, I = 1, . . . , L) -»• 1, as 7 00. ■ 
We use Lemma 1-3 and equation (5) in the following to obtain an approximation for Edd 
when 7 and (3 are large. Large 7 and j3 are needed for small probability of error. Then we can 
assume that the local nodes are making correct decisions. 
Let 

I* = min{j : 5{ FC > and < E[t j+1 \ - E\t 3 }}. 
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Fj can be iteratively calculated as 

Fj = + 6{ FC (E[t 3 ] - E[t^]), Fo = 0. (6) 

Note that 5{ FC (0 < j < L) is assumed to be j and tj is the j th order statistics of {Nf, < / < 
L}. The Gaussian approximation (5) can be used to calculate the expected value of the order 
statistics using the method given in [3]. This implies that E[tj]'s and hence Fjs are available 

offline. By using these values E DD E^N 1 )) can be approximated as 

- F* 

E DD « E[ti»] + j (7) 

°l,FC 

where the first term on R.H.S. is the mean time till the drift becomes positive at the fusion node 
while the second term indicates the mean time for Fk to cross from ti* onward. 

2) Pmd/Pfa Analysis: We provide analysis under H±. Pfa analysis is same as that of 
Pmd analysis with necessary changes. When the thresholds at local nodes are reasonably large, 
according to Lemma 3, with a large probability local nodes are making the right decisions and 
tk can be taken as the order statistics assuming that all local nodes make the right decisions. 
Then Pmd at the fusion node, when H 1 is the true hypothesis, is given by, 

Pmd = Pi (accept H ) = Pt{N < N t ). 

It can be easily shown that P%(Ni < oo) = 1 for any > 0. Also P\(N < oo) — > as 
— > oo. We should decide the different thresholds such that Pi(AT 1 < t\) is small for reasonable 
performance. Therefore 

P MD = Pi(N < N x ) > P^Nq < t u N, > t x ) w P 1 (iV < t x ). (8) 

Also, 

P(iVo < Nj) < P 1 (N < oo) = P 1 (N < h) + P x {t x <N < t 2 ) + P x {t 2 <N <t 3 ) + ... (9) 

One expects that the first term in the right hand side should be the dominant term. This is 
because, from Lemma 3, after t\, the drift of Fk will be most likely more positive than before t\ 
(if Pfa at local nodes are reasonably small) and cause fewer errors if the fusion center threshold 
is chosen appropriately. We have verified this from simulations also. Hence we focus on the first 
term. Combining this fact with (8), Pi(Nq < h) will be a good approximation for Pi (reject Pi). 

Let S k = log [gi (Y k ) /g (Y k )]. Then F k = Si + S 2 + ... + S k and if we assume that S k before 
ti, has mean zero and has distribution symmetric about zero (e.g., ~ A/"(0, a 2 )) then, 

Pi (reject Pi before t{) 
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(A) 



(B) 
> 



J^Pi [{Fk < -/?} n^i {F n > — /9}|* a > fcjpfc > fc] 
fc=i 

oo 

£ (p x [f, < -A ntl K > -/3}]Pi[n^l {P n > -/?}]) (l - 
fc=i 

oo 

£ (Px[P fe < -P\F k -i > -PW^l^Fn > -/?]) (l - 

k=l ~ n ~ 

^ ( / Pi[S k < -cjfF^i-P + c] dc) (l - 2P 1 [F fc _ 1 < -/?]) (l - $ tl (fc) 
fc=i "' c = 



where $ tl is the Cumulative Distribution Function of t\. Since we are considering only {F k , k < 
t\}, we remove the dependencies on t\. In the above equations (A) is because of the Markov 
property of the random walk and (B) is due to the following lemma. 

Lemma 4. If S\ has mean zero and distribution symmetric about zero, 



inf F n > -9 

Kn<k-1 



> 1 - IP 



F k -i < -9 



Proof: For random walk F k with mean zero and symmetric distribution, [4, p. 525], 



This implies 



Therefore, 



and hence 



sup F n > 

Kn<fe-1 



sup (-F n ) > 9 

Kn<fc-1 



P 



inf F n < 

L l<n<W 



< IP 



< IP 



< 2P 



inf F n > -9 

Kn<fc-1 



> 1 - 2P 



F k -i > 9 
(-iVi) > 6 
P fc -i < -0 

F k -! < -H 



(10) 



Similarly we can write an upper bound by replacing P[n^l\{F n > — 9}} with P[F k _i > —9]. 
We can make the lower bound tighter if we do the same analysis for the random walk between 
ti and t 2 with appropriate changes and add to the above bounds. 



11 



3) Example I: We apply the DualSPRT on the following example and compare the E DD and 
Pfa via analysis provided above with the simulation results. We assume that fo and fi are 
Gaussian with different means. This model is relevant when the noise and interference are log- 
normally distributed ([22]), and when X^i is the sum of energy of a large number of observations 
at the secondary nodes at a low SNR. 

Parameters used for simulation are as follows: There are 5 secondary nodes, L = 5, fo ~ 
A/"(0, 1) and fi ~ A/"(l, 1), where J\f(a, b) denotes Gaussian distribution with mean a and variance 
6. Also f = / 0j ; and f t = for 1 < I < L, and 6=1. The Pmd/Pfa and the corresponding 
Edd are provided in Table I. The parameters are chosen to provide good performance for the 
given Pmd/Pfa- The table also contains the results obtained via analysis. We see a good match 
in theory and simulations. The above example is for the case when X^i have the same distribution 



PiADSim. 


PmoAnal. 


EonSira. 


Edd Anal. 


0.00125 


0.0012 


15.6716 


16.4216 


0.01610 


0.0129 


13.928 


12.6913 



PFASim. 


Pfa Anal. 


EoDSim. 


Edd Anal. 


0.0613 


0.0497 


11.803 


10.583 


0.0031 


0.0027 


15.1766 


14.830 



(a) Under #1 



(b) Under H 



TABLE I: DualSPRT: Comparison of E DD and Pmd/Pfa obtained via analysis (lower bound 
on the dominating term) and simulation. 

for different I under the hypothesis H and H\. However in practice the X k j for different local 
nodes / will often be different because their receiver noise can have different variances and/or 
the path losses from the primary transmitter to the secondary nodes can be different. An example 
is provided here to illustrate the application of the above analysis to such a scenario. Now the 
order statistics t\* in (7) needs to be appropriately computed. 

4) Example II: There are five secondary nodes with primary to secondary channel gain being 
0, -1.5, -2.5, -4 and -6 dB respectively (corresponding post change means are 1, 0.84, 0.75, 0.63, 
0.5). f ~ A/"(0, 1), fo = fo,i for 1 < I < L. Table II provides the E DD and P F a via analysis 
and simulations. We see a good match. 



IV. Asymptotic optimality of DualSPRT 

The two hypotheses H and Hi are assumed to have known prior probabilities tx and 1 — n 
respectively. A cost c (> 0) is assigned to each time step taken for decision. Let Wi > 0, i — 0, 1 
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PFASim. 




EooSira. 


Edd Axial. 


18.78e-4 


19.85e - 4 


44.319 


43.290 


26.68e-4 


27.51e - 4 


36.028 


34.634 


36.30e - 4 


35.16c - 4 


27.770 


25.977 



TABLE II: DualSPRT for different SNR's between the primary and the Secondary Users: 
Comparison of E DD and P F a obtained via analysis and simulation. 

be the cost of falsely rejecting Hi. Then Bayes risk of a test S with stopping time iV is defined 
as 

K c (5) = tt[cE (N) + W P {reject H }} + (1 - ^[cE^N) + W^rejectHi}]. (11) 

Also, KL-divergence of two probability distributions P and Q on the same measurable space 
(O, J 7 ) is defined as 

' /loggdP , if P«Q, 
oo , otherwise , 

where P « Q denotes that P is absolutely continuous w.r.t. Q. We also use the following 
notation: 



D(P\\Q) 



(12) 



D° tot = J2 D (foi\\hi), D lot = J2 D ^\\f^ 



i , n 



D(fo,i\\h,i) D(hA\ki) 
-> Pi = 



u tot 



D l tot 



i=i i=i 
Let A(c) be the event that all the Secondary Users transmit bi when the true hypothesis is 

Hi. Also let A(A i ) be the drift of the fusion center LLR process F k when the A happens, i.e., 



A{A 



Ei 



(log 



)\A l \. We will also need 



77(c) = sup < n > 1 : log 



i=l 



h,i{Xii) I 1 / \ a 

7 7y v > —| logc| V , r(c) = max 71(c). 



(13) 



It can be seen that 77 (c) is the last time random walk with drift 5 i < 0, will be above — | logc|. 

We make the following assumptions for Theorem 1 and Theorem 2. 
(Al) {Xk t i, k > 0} is i.i.d. and independent of {X^ j, k > 0} for all / 7^ j. 
(A2) The following hold, for each I, 



log 



fi,i( x ) 



fij(x)dx < 00 and 



log 



f j(x)dx < 00. 



Jo,i(x)JJ ' J \ \fi,i(x) 

In the rest of this section, local node thresholds are 70 j = — rj| logc|, 71^ = p;|logc| and 
fusion center thresholds are f3 = — | logc|, /?i = | logc|. 
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We use 6i as the mean of the drift of fusion center random walk F k when all the local nodes 
transmit wrong decisions under Hi and a 2 as the variance of the drift, which is independent of 
local node decisions. For convenience in the following theorem, we take p,\ = /i = fx > and 

b 1 = -b = b> 0. 



Theorem 1. For DualSPRT with the assumptions A(l) — A(2), 



lim 



E [N) 



< 



+ M and lim 



< 



1 



D tot 



c ->o | logc| D)> ot c^o I logc| 

where M and Mi are of the form C /A(A°) and Ci/A(A 1 ) respectively, where C and C\ 
are constants. 

Proof: See Appendix A. ■ 

Remark 1. If we assume fusion center noise as Af(0,ap C ), then A(^4°) = —2fxLb/a 2 FC and 
A (A 1 ) = 2fxLb/ a 2 FC . Therefore M and Mi in Theorem 1 — > if fx — > oo and/or b — > oo and/or 
a\ c -> 0. 

Table III compares the asymptotic upper bounds of E DD from Theorem 1 with the analysis 
given in Section III-B and simulations. We use the example provided in Section III-B3 and 
assume 7r = 0.5. We see that the approximate analysis of Section III-B provides much better 
approximation at threshold values of practical interest in Cognitive Radio. Perhaps this is the 
reason, the asymptotically optimal schemes do not necessarily provide very good performance 
at operating points of practical interest. 



PFASimn. 


EnDSimn. 


EnoAnal. 


EoDAsym. 


0.01610 


13.928 


12.6913 


19.35 


0.00125 


15.6716 


16.4216 


21.17 



Pfa Simn. 


EouSiran. 


EoD-Anal. 


EoDAsym. 


0.0613 


11.803 


10.583 


18.26 


0.0031 


15.1766 


14.830 


20.81 



(a) Under Hi 



(b) Under H 



TABLE III: Comparison of Edd obtained via simulation, analysis and asymptotics. 



Let Ri = min 1<K i 



such that Ei 



exp 



loginf t > £ 

' foAXi.,) 



exp I —t log 



fo,l{Xi,i) 



and ip\ be the minimal value 



e r] for 7] > and (p t = Ya=i Pi- Let Cifc = lo S 



g-a {Y k ) ■ 



Then F n = J2k=i£k and fii(k) = Ei[£ h ]. We assume the fusion center noise is Af(Q,ap C ). The 
following lemma will be needed in the next theorem. 
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Lemma 5. If/j, < min (J^f 1 , ^^J, then E [e mF ^] < oo for < 771 < R* and R* > 1. 

Proof: See Appendix B. ■ 

Theorem 2. Assume A(l) — A{2). Let fi < min (yJ R ° a 8 F ^ , \ an d the fusion center noise 

be Gaussian. If 2ct > 1 + <p Q H — -, for some < ri < R , then lim — r = 0. Also 

2(a 2 + 9 ) a 2 c^oc|logc| 

■r ^ 2^2 . -, Ql j. n ,. -PmD „ 

y o n \ > 1 + V^i H o' i or ■ s ' ome < 77 < Ki, then lim — r = 0. 

2{a 2 + d\ ) o l c-^o c|iogc| 

Proof: See Appendix C. ■ 
Let lZ c (Scent.) and 1Z C (8 Duals prt) be the Bayes's Risk function of the optimal centralized SPRT 
without considering fusion center noise and of DualSPRT respectively. For optimal centralized 
SPRT without considering fusion center noise, ([13, p. 2076]), 

,. fcc(5cent.) _ ( 1 ~ 

™ c| logc| ~-{Dl t + DlJ- 
From Theorem 1 and Theorem 2, using (11), for DualSPRT with Gaussian fusion center noise, 

lim ^^prt) = ( jr 1-n g 



c"A"6 c |logc| VA° oi #L 

where C = M 7r + Mi(l — 7t). The constant C — >■ if we assume b{ and/or fj, as a linear function 
of I log c| . This shows that DualSPRT is asymptotically Bayes in the centralized setting, i.e., 

hm c _K) K c {8 cent) /Tl c {8 DualS PRT) = L 

Remark 2. By observing that [13, Theorem 1] remains applicable for DualSPRT, from [13, 
Theorem 2], DualSPRT is also asymptotically Bayes with respect to the Bayes solution in the 
system with full local memory and without considering fusion center noise. 

Remark 3. As the cost c decreases, essentially we are allowing more samples for detection, 
which is captured in the modified expressions of 7 0j /, 71,/, (3 and (3\. 

For Gaussian input observations at the local nodes, assuming fi 1 = f lt f ; = f for 1 < I < L, 

S'f , L5i 
- and <pi = - 

Using Lemma 1, 9 and 9\ can be taken as zero. Assume H : Af(0, 1) and a FC = I. For SNR 
values -4dB (H x : JV(0.63, 1)), -lOdB (H x : JV(0.3162, 1)) and -15dB (H x : A/"(0.1778, 1)), 
the conditions in the Theorem 2 are satisfied with /x in Lemma 5. 



we get Su = Si and p u = p h R { = — and (p t = — =-(1 - 1/V2 

2pJ P 



15 



V. Decentralized Sequential Tests: SPRT-CSPRT 

This section considers some improvements over DualSPRT. The improved algorithms are 
theoretically analysed and their performance is compared with existing decentralized schemes. 

A. New Algorithms: SPRT-CSPRT and DualCSPRT 

In DualSPRT presented in Section III-A, observations {Yk} to the fusion center are not 
always identically distributed. Till the first transmission from secondary nodes, these observations 
come from i.i.d. noise distribution. But after the transmission from the first local node till the 
transmission from the second node, Y k s are distributed as i.i.d. noise distribution with a different 
mean. Thus the observations at the fusion center are no longer i.i.d.. Since the non- asymptotic 
optimality of SPRT is known for i.i.d. observations only ([21]), using SPRT at the fusion center 
is not optimal although we have shown it to be asymptotically optimal. 

We improve DualSPRT with the following modifications. Steps (l)-(3) (corresponding to the 
algorithm run at the local nodes) are same as in DualSPRT. The steps (4) and (5) are replaced 
by: 

4) Fusion center runs two algorithms: 

F l k = (Fl, + log [<7 W (n) Iqz (Y k )]) + , F l = 0, (14) 

F° k = (Fl, + log [g z (Y k ) /g_, (Y k )])~, F ° = 0, (15) 

where = max(0, x), (x)~ = min(0, x) and [i\ and /i are positive constants. g z is the 
pdf of i.i.d. noise {Z k } at the fusion center and indicates the pdf of fi + Z k . 

5) The fusion center decides about the hypothesis at time N where 

N = mf{k : Fl > fa or F° k < /3 } 

and Po < < fa. The decision at time is Hi if > fa, otherwise H . 
The following discussion provides motivation for this test. 

1) A sample path of the fusion center SPRT under the hypothesis Hi is provided in Figure 1. 
If the SPRT sum defined in (3) goes below zero it delays in crossing the positive threshold 
fa. Hence if we keep SPRT sum at zero whenever it goes below zero, it reduces Edd- This 
happens in CUSUM ([14]). Similarly one can use a CUSUM statistic under H also. These 
ideas are captured in (14) and (15). 
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I 

Fig. 1: Sample Path of F k under SPRT Sum and CSPRT Sum for 71 = 8, ft = 20, ^ = 2 and 
A*o = -2- 

2) The proposed test is also capable of reducing false alarms caused by noise Z k before first 
transmission (t{) from the local nodes. For F£ and F° to move away from zero, the drifts 
should be positive and negative respectively. Let = E[Y k ) at time k. Then expected drift 
of Fl is 

= D( 9 f ik \\g z )-D(gp k \\g lil ). (16) 

Hence before first transmission from the local nodes positive expected drift is not possible. 
After first transmission under H 1 (assuming the local nodes make correct decisions, the 
justification for which is provided in Section III), the drift becomes more positive. Similarly 
for Ft But in case of DualSPRT, SPRT at the fusion center has drift given by log g " l( %\ . 
This drift is difficult to keep zero only before t\ and thus creates more errors due to noise 

Zk- 

3) Even though the problem under consideration is hypothesis testing, this is essentially a 
change detection problem at the fusion center. The observations at the fusion center have 
the distribution of noise before t\ and after t\ the mean changes. But in our scenario, this 
is a composite sequential change detection problem with the observations that are not i.i.d. 
and we look for change in both directions, it is difficult to use existing algorithms available 
for sequential change detection. Nevertheless our test provides a guaranteed performance 
in this scenario. 

We consider one more improvement. When a local Cognitive Radio SPRT sum crosses its 
threshold, it transmits &i/&o- This node transmits till the fusion center SPRT crosses the threshold. 
If it is not a false alarm, then its SPRT sum keeps on increasing (decreasing). But if it is a 
false alarm, then the sum will eventually move towards the other threshold. Hence instead of 
transmitting bj b the Cognitive Radio can transmit a higher / lower value in an intelligent 
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fashion. This should improve the performance. Thus we modify step (3) in DualSPRT as, 

4 

Y Kl = Y^b]l{W Kl e [Ti+(i-l)A 1 , 7l +iA 1 )}+6?I{W fc ,, e [- T o-(i-l)A a ,-7o-iA )} (17) 

i=i 

where Ai and A are the parameters to be tuned at the Cognitive Radio. 4A X and 4A are taken 
as oo. The expected drift under H\ (Ho) is a good choice for Ai (A ). 

We call the algorithm with the above two modifications as SPRT-CSPRT (with 'C as an 
indication about the motivation from CUSUM). 

If we use CSPRT at both the secondary nodes and the fusion center with the proposed 
quantisation methodology (we call it DualCSPRT) it works better as we will show via simulations 
in Section V-B. In Section V-C we will theoretically analyse SPRT-CSPRT. As the performance 
of DualCSPRT (Figure 2a) is closer to that of SPRT-CSPRT, we analyse only SPRT-CSPRT. 

B. Performance Comparison 

Throughout the section we use 71 = 70 = 7, (5\ = (3 = /3 and Hx = pL — ji for the simplicity 
of simulations and analysis. 

We apply DualSPRT, SPRT-CSPRT and DualCSPRT on the following example and compare 
their E DD for various values of Pmd- We assume that the pre-change distribution f and the 
post change distribution fi are Gaussian with different means. 

For simulations we have used the following parameters. There are 5 nodes (L = 5) and 
fo,i ~ -A/"(0j 1)' f° r 1 < I < L. Primary to secondary channel gains are 0, —1.5, —2.5, —4 and 
—6dB respectively (the corresponding post change means of Gaussian distribution with variance 
1 are 1, 0.84, 0.75, 0.63 and 0.5). We assume Z k ~ VV(0, 5) and the drift of DualSPRT and 
SPRT-CSPRT at the fusion center is taken as 2fj,Y k , with /i being 1. We also take D — D\ — 0, 
{b\, bl, bl, b\} = {1,2, 3, 4}, {6°, 6°, b° 3 , b\} = {-1, -2, -3, -4} and &i=-&o=l (for DualSPRT). 
Parameters 7 and /3 are chosen from a range of values to achieve a particular Pfa- Figure 2a 
provides the E DD and Pmd via simulations. We see a significant improvement in E DD compared 
to DualSPRT. The difference increases as Pmd decreases. The performance under H is similar. 

Performance comparisons with the asymptotically optimal decentralized sequential algorithms 
which do not consider fusion center noise (DSPRT [7], Mei's SPRT [13]) are given in Figure 2b. 
Note that DualSPRT and SPRT-CSPRT include fusion center noise. Here we take / 0j , ~ J\f(0, 1), 
~ Af(l, 1) for 1 < I < L and Z k ~ Af(0, 1). We find that the performance of SPRT-CSPRT 
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is close to that of DSPRT and better than Mei's SPRT. Similar comparisons were obtained with 
other data sets. 
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(a) Comparison among DualSPRT, SPRT-CSPRT and DualC- (b) Comparison among DualSPRT, SPRT-CSPRT, Mei's 
SPRT for different SNR's between the Primary and the sec- SPRT and DSPRT under Hi. 
ondary users, under Hi. 

Fig. 2: Comparisons with SPRT-CSPRT 



C. Performance Analysis of SPRT-CSPRT 

E DD and Pmd/Pfa analysis is same under Hi and H . Hence we provide analysis under H 1 
only. 

1 ) Pm d Analysis: Between each change of drift (which occurs due to the change in number of 
Cognitive Radios transmitting to the fusion node and due to the change in the value transmitted 
according to the quantisation rule (17)) at the fusion center, under Hi, (14) has a positive drift 
and behaves approximately like a normal random walk. Under Hi (15) also has a positive drift, 
but due to the min in its expression it will stay around zero and as the event of crossing negative 
threshold is rare (15) becomes a reflected random walk between each drift change. Similarly 
under H , (14) and (15) become reflected random walk and normal random walk respectively. 
The false alarm occurs when the reflected random walk crosses its threshold. 

Under Hi, let 

t p = M{k > 1 : < -/?} and 7> = M{k > 1 : > /?}. (18) 
Following the same argument in Section III-B2 P F a analysis, we get, 

^(reject H x ) = P x { Tp < 7» w P x { Tp < t x ). 

oo oo 

Pi{r p < ti) = J2 P i( r P <k,k<h) = ^Pi(t> < k\k < ti)Pi(t x > k). (19) 

k=l k=l 
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In the following we compute P\{rp > xjr^ < ti) and P\{ti > k). It is shown in [17] that, 

lim Pi{rp > x\tb < t±} = exp(— Xpx), x > 0, (20) 

where is obtained by finding solution to an integral equation obtained via renewal arguments 
([18]). Let L(s) be the mean of with F ° = s and S k = \og[g z (Y k ) /g_ Mo (Y k )]. Note that 
{Sk, k < ti} are i.i.d. From the renewal arguments, by conditioning on Si = z, 

L(s) = P(S 1 > s)(L(0) + 1) + / (L(s + z) + l) dF Sl (z) dz + F Sl (-f3 - s), 

J-ft-S 

where Fs is the distribution of before the first transmission from the local nodes. This is a 
Fredholm integral equation of the second kind ([19]). Existence of a unique solution for it is 
shown in [2]. By solving these equations numerically, we get = 1/L(0). 

From the central limit theorem approximation given in Section III-B1 we can find the distri- 
bution of t\. Thus (19) provides, 

oo L 

Pi (False alarm before t x ) » - e~ x e k ) - $ Nl (k)), 

k=l 1=1 

where $iY ( is the Cumulative Distribution Function of Ni, obtained from the Gaussian approxi- 
mation. 

2) E DD Analysis: In this section we compute E DD theoretically. Recall that t ; also indicates 
the first time at which i local nodes are transmitting. Mean of U can be computed from the 
method explained in [3], for finding k th central moment of non i.i.d. i th order statistics. 

Between ij and t ;+1 the drift at the fusion center is not necessarily constant because there 
are four thresholds (each corresponds to different quantizations) at the secondary node. The 
transmitted value changes after crossing each threshold, bi — > b 2 . . . — > b A . Let t\, 1 < j < 3 be 
the time points at which a node changes the transmitting values from bj to bj+i between ^ and 
U+i. We assume that with a high probability the secondary node with the lowest first passage time 
mean will transmit first, the node with the second lowest mean will transmit second and so on. 
This is justified by the fact that the distribution of a first passage time of 7 > by a random walk 
with mean drift 5 > and variance cr 2 is A/"(|, ^p 2 )- Thus if 5 is large, the mean 7/5 is small 
and the variance a 2, y/5 3 is much smaller. In the following we will make computations under 
these approximations. The time difference between tf and tf^ transmission can be calculated 
if we take the second assumption (=Ai/<5i /). We know E[U] for every i from an argument 
given earlier. Suppose I th node transmits at tf 1 instant and if E[ti) + Ai/5± : i < E[t i+ i] then 
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E[t\] = E[U] + Ai/<5i,,. Similarly if E[t}] + AJd^ < E[t l+l ] then E[tf\ = E[t\] + A 1 /5 hl and 
so on. Let us represent the sequence t = {ti,t\,tf,tl,t2, • ••>£|} (entry only for existing ones by 
the above criteria) by T = {T h T 2 , T 3 , ...}. 

Let S^ FC be the mean drift at the fusion center between T k and T k+1 , under H^. Thus T k s 
are the transition epochs at which the fusion center drift changes from 5^ F q to Sf FC . Also let 
F k = E[Fr k -i] be the mean value of F k just before the transition epoch T k . With the assumption 
of the very low Pf a at the local nodes and from the knowledge of the sequence t we can easily 
calculate <^ FC for each T k . Similarly F k+l = F k + 5\ )FC {E[T k+l ] - E[T k }). Then, 

E DD « E[T t .] + (21) 

where 

r = mm{j : <5 J 1FC > and < E[T J+1 ] - 

"1,FC 

The above approximation of Edd is based on Central Limit Theorem and Law of Large 
Numbers and hence is valid for any distributions with finite second moments. 

Table IV provides the simulation and corresponding analysis values. We used the same set-up 
as in Section V-B (with Z k ~ W(0, 1)). 



Paid Sim. 


PuuAnal. 


EoDSim. 


EooAnal. 


0.01675 


0.01624 


26.8036 


24.9853 


0.0072 


0.0065 


30.0817 


29.1322 


0.00686 


0.00623 


36.1585 


35.5624 



TABLE IV: Comparison of Edd and Pmd obtained via analysis and simulation under Hi for 
SPRT-CSPRT. 

VI. Unknown Received SNRs and Fading 

This section considers the extensions of DualSPRT and SPRT-CSPRT to take care of the SNR 
uncertainty and the slow fading between the Primary User and a Cognitive Radio. Since the 
transmissions from CR to FC are in CR network, we assume reporting channel to FC as AWGN 
only. This assumption is commonly made ([1], [22]) 
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A. Different and unknown SNRs 

We consider the case where the received signal power from the PU to a CR node is fixed 
but not known to the local Cognitive Radio nodes. This can happen if the transmit power of 
the primary is not known and/or there is unknown shadowing. Now we limit ourselves to the 
energy detector where the observations X k j are average energy of M samples received by the 
I th Cognitive Radio node. Then for somewhat large M, the pre and post change distributions of 
Xk,i can be approximated by Gaussian distributions: / ,z ~ N(a 2 , 2 erf /M) and fx,i ~ M{Pi + 
a 2 , 2(P[ + ai 2 ) 2 /M), where Pi is the received power and af is the noise variance at the I th CR 
node. Under low SNR conditions (P t + af) 2 ~ erf and hence X k j are Gaussian distributed with 
mean change under H and Hi. Now taking X fc ; — af as the data for the detection algorithm 
at the I th node, since Pi is unknown we can formulate this problem as a sequential hypothesis 
testing problem with 

H :6 = 0;H 1 :6>6 ll (22) 

where 9 is Pi under Hi and 6\ is appropriately chosen. 
The problem 

H :6<6 ;Hi:6>6 u (23) 
subject to P e {reject H } < a, for 9 < 9 and P e {reject Hi} < /3, for 9 > 9 1 , 

for exponential family of distributions is well studied in ([12]). The following algorithm of Lai 
[12] is asymptotically Bayes optimal and hence we use it at the local nodes instead of SPRT. 
Let 9 <E A = [ai, ct 2 ]. Define 



W n i = max 



log ^7^T'2^ lo f 



fe {X k )' ^ fe 1 (X k ) 



N(g, c) = inf {n : W n , t > g{nc)} , (24) 



where g() is a time varying threshold. The function g satisfies g(t) ~ log(l/t) as t — > and 
is the boundary of an associated optimal stopping problem for the Wiener process ([12]). 9 n 
is the Maximum-Likelihood estimate of 9 bounded by a\ and a-i- For Gaussian f and fx, 
9 n = max{ai, min[S n /n, a 2 ]}. At time N(g,c) decide upon H or Hi according as #zvo,c) < 
or ^at (9iC) > 9*, where 0* is obtained by solving D(fa\\fg ) = D(f e *\\f ei ). 

For our case where H : 9 = 0, unlike in (23) where H : 9 < 0, E[N\H ] largely depends 
upon the value 9 X . As 9i increases, E[N\H ] decreases and E[N\Hi] increases. If Pi E [P,P] 
for all I then a good choice of 8i, is (P — P)/2. 
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1 ) GLR-SPRT: First we modify DualSPRT. In the distributed setup with the received power 
at the local nodes unknown, the local nodes will use the Lai's algorithm mentioned above while 
the fusion node runs the SPRT. All other details remain same. We call this algorithm GLR-SPRT. 

The performance of GLR-SPRT is compared with DualSPRT (where the received powers are 
assumed known at the local nodes) for the example in Section III-B4 in Table V. Interestingly 
£7[JV|iZi] for GLR-SPRT is actually lower than for DualSPRT, but E[N\H Q ] is higher. 



Edd 


Pfa = 0.1 


Pfa = 0.05 


Pfa = 0.01 


DualSPRT 


1.921 


3.074 


5.184 


GLRSPRT 


2.745 


3.852 


6.115 



Edd 


Paid = 0.1 


Pmd = 0.05 


Pmd = 0.01 


DualSPRT 


2.06 


3.177 


5.264 


GLRSPRT 


1.425 


2.522 


4.857 



(a) Under H 



(b) Under H ± 



TABLE V: Comparison of E DD between GLRSPRT and DualSPRT for different SNR's between 
the Primary and the Secondary Users. 

2) GLR-CSPRT: This is a modified version of SPRT-CSPRT Here, we modify GLR-SPRT 
to GLR-CSPRT with appropriate change in quantisation and using CSPRT at the fusion cen- 
ter instead of SPRT. The quantisation (17) is changed in the following way: if 9 N > 9*, 
let Z x = [g(k c),g(kc3A)), X 2 = [g(k c3 A), g(k c2 A)), X 3 = [g{k c2 A), g(k c A)) and 
2-4 — [g{k cA), oo). Yh,i = b x n if Wk,i G 1 n for some n. If 9n < 9* we will transmit from 
{b®, ft", &3, ^4} under the same conditions. Here, A is a tuning parameter and < 3A < 1. 

The performance comparison of GLR-SPRT and GLR-CSPRT for the example in Section V-B 
(with Zk ~ A/"(0, 1)) is given in Figure 3a and Figure 3b. Here A = 0.25. As the performance 
under Hi and H are different, we give the values under both. We can see that GLR-SPRT is 
always inferior to GLR-CSPRT. For Edd under H±, interestingly GLR-CSPRT has lesser values 
than that of SPRT-CSPRT for P FA > 0.02 (note that SPRT-CSPRT has complete knowledge of 
the SNRs), while under H it has higher values than SPRT-CSPRT. 



B. Channel with Fading 

In this section we consider the system where the channels from the primary transmitter to the 
secondary nodes have fading [hi 7^ 1). We assume slow fading, i.e., the channel coherence time 
is longer than the hypothesis testing time. 
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(a) Under Hi 



(b) Under H 



Fig. 3: Comparison among SPRT-CSPRT, GLR-SPRT and GLR-CSPRT for different SNR's 
between the Primary and the Secondary Users 

When the fading gain hi is known to the I th secondary node then this case can be considered 
as the different SNR case as in the example given in Section III-B4. Thus we consider the case 
where the channel gain hi is not known to the I th node. 

We consider the energy detector setup of Section VI-A. However, now Pi, the received signal 
power at the local node / is random. If the fading is Rayleigh distributed then Pi has exponential 
distribution. The hypothesis testing problem becomes 

Ho ■ fo,l ~ Af(0, cr 2 ); H x : f u ~ N{0, a 2 ) (25) 

where 9 is random with exponential distribution and a 2 is the variance of noise. We will assume 
that cr 2 is known at the nodes. 

We are not aware of this problem being handled via sequential hypothesis testing before. 
However we use Lai's algorithm in Section VI-A where we take Q\ to be the median of the 
distribution of 9, i.e., P{9 > 9i) = 1/2. This seems a good choice for 6\ as a compromise 
between E[N\H ] and E[N\Hi]. 

1) GLRSPRT: First we apply the technique on GLRSPRT. We use an example where a 2 = 
1,9 = exp(l), Var(Zfc) = 1, and L = 5. The performance of this algorithm is compared with 
that of DualSPRT (with perfect channel state information) in Table Via (under H ) and Table 
VIb (under Hi). We observe that under Hi, for high Pmd this algorithm works better than 
DualSPRT with channel state information, but as Pmd decreases DualSPRT becomes better 
and the difference increases. For H , GLRSPRT is always worse and the difference is almost 
constant. 

2) GLR-CSPRT: We use the example given above. Table VII provides comparison of Dual- 
SPRT, GLR-SPRT and GLR-CSPRT. Notice that the comment given for E DD for Figure 3a is 
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Edd 


Pfa = 0.1 


Pfa = 0.05 


Pfa = 0.01 


DualSPRT 


1.669 


2.497 


4.753 


GLRSPRT 


3.191 


4.418 


7.294 



Edd 


Pmd = 0.1 


Pmd = 0.08 


Pmd = 0.06 


DualSPRT 


1.74 


1.854 


2.417 


GLRSPRT 


1.62 


3.065 


5.42 



(a) Under H 



(b) Under #i 



TABLE VI: Comparison between GLRSPRT and DualSPRT with slow-fading between Primary 
and Secondary Users 



also valid here. 



Edd Pfa=0.1 P F a=0.07 P fa =0.04 E dd Pmd=0.1 P M c=0.07 P A fD=0.04 

DualSPRT 1.669 1.891 2.673 DualSPRT 1.74 1.948 2.728 

GLR-SPRT 3.191 3.849 4.823 GLR-SPRT 1.62 3.533 9.624 

GLR-CSPRT 2.615 3.192 4.237 GLR-CSPRT 0.94 1.004 4.225 



(a) Under H (b) Under #i 

TABLE VII: Comparison among DualSPRT, GLR-SPRT and GLR-CSPRT with slow fading 
between the Primary and the Secondary Users 



VII. Conclusions 

This paper presents fast algorithms for cooperative spectrum sensing satisfying reliability 
constraints. We have presented and analysed DualSPRT, a decentralized sequential hypothesis 
test. Simulation results corroborate the theoretical study of DualSPRT Asymptotic properties 
of DualSPRT are also explored and it is found to be asymptotically optimal with respect to 
the optimal centralized test. Improvement over DualSPRT using CUSUM statics for the fusion 
center test leads to another algorithm in which the selection of parameters is easy to choose apart 
from performance enhancement. We also provide theoretical analysis of the algorithm. Numerical 
experiments show that this algorithm performs as well as an asymptotic order-2 optimal algorithm 
without fusion center noise, proposed in literature. We further extend our algorithms to cover 
the case of unknown SNR and channel fading and obtain satisfactory performance compared to 
perfect channel state information case. 
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Appendix A 
Proof of Theorem 1 

We will prove the theorem under H . The proof under Hi will follow in the same way. 
Let u(a) be the stopping time when a random walk starting at zero with drift A (.4°) (-ve 
under our assumptions) crosses a. Then, 

N < N < r(c) + v{-\ logc| - F T(c) ). 

Xhcrcforc 

w„\m < Eo [t(c)} | E K-|logc|-F r(c) )] 



| log c| I log c| 

We consider the first term on the R.H.S. of (26). Under our assumptions and choice of local 
node thresholds as p;|logc| and — rj| log c|, from [13, Theorem 2] and the definition of the 
stopping time in [13, p. 2084], it can be seen that 

c ^o | log c| D u tot 

Also from [8, Remark 4.4, p. 85] as c — > 0, 77(c) — > oo a.s. and lim ,T = — a.s. 

c^o|logc| d 0> i 

Therefore 

T ^^max--^ia.s. (28) 
I log cj i <V* § 

Furthermore, from [10, proof of Theorem 1 (i) =>■ (ii) p. 871], it can be seen that {77(c) /| logc|} 

is uniformly integrable. Thus, 

" ~ E[r{c)\ ^ 1 



logc| £ 



The second term in R.H.S. of (26), 



Eq[u(-\ 


logc 


-F T(C) )] ^ E [u 


H 


lo 


gc|)] 




1 log c 




loj 


?c| 





We know, from ([8, Chapter III]), 

Eq[u(-\ logc| 



I log c| A(A )' 
Next consider E [u(—F T ^)}. We have, 

I log c| 

£^)[i/(-F T(c ))] 1 



+ UL , TC,Ji . (29) 
I log c \ 



-> ~^T^- (30) 



J E [v(-x)\F t{c) = x] (1P Ft(c) (x) 



log c\ I log c 

+ rr l - 7 / ^K-s)]^ r 





00 



(c) V X J 



I log c| 



26 



< E [u(-\\ogc\)] | 
| log c | 



EoH-x)] x 

\\ogc\ T(C)V ; 



X 



I log c| 



Since Eq[v(—x)\/x -> -1/A(„4°) as x — >■ oo, for any e > 0, 3 M such that 



EoW-x)] 



< e 



x A (.4°) 

Take c\ such that | logc| > M for c < c\. Then, for c < c\, 

i 



for x > M. 



E [u(-x)] x 



X 



log c 



dP Fr , Jx) < 



A(A°) 



(c) 



log c| 



xdP FT(c) (x) < 



e — 



A(A°) 



log C 



I log c] 



I log c| 



(31) 



So^rCc)]- (32) 



Since lim c ^ r (c)/| logc| = 1/S a.s. and {r(c)/| logc|} is uniformly integrable, when 
< oo, 1 < / < L, we get, ([8, Remark 7.2, p. 39]), 



log 



/i,i(*i,<) 



/o,z(^i,i) 



lim rw<I eo , 

c^O | log C | <J 



where 9 is upper-bounded by A(A 1 ). 
From (31), (32) and (33), 



1 \ #o 



c^o |logc| ~ A(A°) v , 7 „„ 

This, with (26), (27) and (29), implies that (since e can be taken arbitrarily small), 

, E o[N) 1 
lim ^-4 < -tttt- + M , 



c^o | log c | Z>£ 



/(.)/ 



where M n 



(33) 



- - t^-t— ( 2 + ^ ) . Due to Lemma 1, #o approaches zero as c — > zero, 
Similarly we can prove lim — — — \ < — -. h Mi, Mi > being a constant depended on 



^o|logc| D\ ot 
A{A x )Mh = max(l/5 u ),^i and D\ ot . 



Appendix B 
Proof of Lemma 5 



By Holder's inequality, 



E [e^co] = e 
< (e q 



E T ( C ) i- 



E 



3 »7iPEl=i(Cfc-Mo(fc)) 



1/p 



1/9 



(34) 
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where l/p + l/q = 1. Let p' = rjip and £' k = £ fc — fj,(k). Thus, is i.i.d. ~ Af(0,a FC ). 



Also, E [e^M] < oo if E 
independent of r(c), 



< oo and E 



e »?l?Z)fc=i w( fc ) 



< oo. Since Yl=i & is 



OO 

n=l 

oo 



P[r(c) = n] 

oo 



n 



n=l 



-En 



n=l 



P[r(c) = n] 



En 



r(c)- 



< OO, 



(35) 



if E [e p 't'i} < oo and E o [0(p) r(c) ] < oo, where <f>[p) = E {e p '&}. From [9, Theorem 1.3] 



E Q [e mTl{ - c) ] < oo, for < r] 2 < R and R l = - log inf t > #c 



-tlog 



. Combining this 



fact with r(c) < Ef=i r «(c) (see (13)) yields Eq^W] < E [e^=i otW] < 00? for < ^ < 



P = min; P[,. Therefore E o [0(p) r(c) ] < oo if < log0(p) < ij 



Also, if < i]iq9 < R , 



En 



< E [e^^o] < oo. 



(36) 



Since the fusion center noise is M(0,a 2 Fr ), log</>(p) = 2-^—r]lp 2 and ^ig^o = mq-^-Lb. 

a FC ' a FC 

From (35) and (36), / 

Ro&FC Ro&FC | A 

2/iV ' 2/iE6g I " 



771 < mm 

Taking p = q = 2, to make R* > 1, we need, 

< min 



P* 



(37) 



Ro&pQ Rq<j fc \ 
~~8 ' ALb I 



(38) 



Appendix C 
Proof of Theorem 2 

We prove the result for P FA . For P MD it can be proved in the same way. 
Probability of False Alarm can be written as, 

P FA = p (Reject H ) = P [FA after r(c)] + P [FA before r(c)]. (39) 

The first term in R.H.S., 

P [FA after r(c)] = P [FA after r(c); *4°(c)] + P [FA after r(c); (A°(c)) c }. (40) 
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Since events {FA after r(c)} and (A°(c)) c are mutually exclusive, the second term in the above 
expression is zero. Now consider P [FA after t(c);A°(c)]. For < r < 1, 

P [FA after t(c);A°(c)] 

< P Random walk with drift A(A°(c)) and initial value F T / C \ crosses | log c| 

< Pq Random walk with drift A(A°(c)) and F T {c) < r\ logc| crosses | logc| 
Random walk with drift A(A°(c)) and F T ( C ) > r| logc| crosses | logc| 

Random walk with drift A(A°(c)) and P r(c ) < r \ logc| crosses | logc| 
+ P [F T (c) >r|logc|] . (41) 
Considering the first term in the above expression, 



+ Po 
< V 



P [Random walk with drift A(A°( 


c)) and P T(c) < r 


logc 


crosses 


logc|] 


c 


logc 





P [Random walk with drift A(A°( 


c)) and P T ( c) = r 


logc 


crosses 


logc|] 


c 


log c 





< 



(A) e -(l-r)s'|logc| 
< 



c| logc| 



11 i^ ' 

c| logc| 



(42) 



iff (1 — r)s' > 1. Here (A) follows from [15, p. 78-79] 1 where s' is positive and it is the 



solution of E 



s' log 



gMl( y fc) 



°-^ Y ^\A°(c) 



We choose s' > 1 and < r < 1 to satisfy (1 — r)s' > 1. For s' > 1, we need Lb/ fx > 1 
under H and Hi which follows from our assumptions. 
Consider the second term in (41). From Lemma 5, 



P [P T(c) >r|logc|] < 



e ?7ir|logc| 



< OO. 



If rjir > 1, 



En\e mF ^} 



->• 0. 



(43) 
(44) 



From Lemma 5, E [e niFir ^'>] < oo is assured by choosing as in (38). Then we can choose 

1 1 

771 > 1, — < r < 1 . 

1]i s' 



'For a random walk W n = ^2™—i Xi, with stopping times T a = inf{n > 1 : W n < Tfc = inf{n > 1 : W n > b} 
and T ai i, = min(r a ,T6), a < < 6, let s' be the non-zero solution to M(s') = 1, where M denotes the M.G.F. of Xi. 
Then, s' < if E[X t ] > 0, and s' > if < and E[exp(s'W Ta . b )] = 1 ([15, p. 78-79]). Then it can be shown that 

P(W Ta ) < exp(-s'a) when E[X t ] > and P(W T „) < exp(-s'b) when E[Xt] < 0. 



29 



Now we consider the second term in (39). Let F% be the Gaussian random walk with mean 
drift 9 , the worst case mean of the drift of F^. Under H , Oq > and under Hi, 0$ < 0. In 
contrast to F£ is a random walk. Then, 



P [FA before r(c)] < J^P ( 

n=l 
oo 



n=l 

oo 



sup F£ > | logc| 

Kk<n 



sup F fc * > | log C | 

Kk<n 



r(c) = n 



P [ r (c) = n] 



P [r(c) = n] 



(A) 



re=l 
oo 



sup (F,* - fc0 o ) > I log c| - n6 Q 

Kk<n 



P\t(c) = n] 



< 2P o [F* -n6 >\ log c\ - n# ] P [r(c) = n] 



n=l 

oo 



< ^exp I — ) P [r(c) = n] 

n=l ^ 



(45) 



2na 2 

where (A) is due to the inequality (10) and (B) follows from the Chernoff bound of Q-function. 
Therefore, 

M 



P [FA before r(c)] 
c| logc| 



1 ^ /• 

c logc ' V 

1 1 n=l x 



logc | 



+ -pp — | V 

c|logc| ^— ' 



exp 



n=M+l 



2Ma 2 

-< 
2a 2 



cxp 



cxp 



2a 2 

6*o | logc 
a 2 



cxp 



6» 1 logc| 



a" 



P [r(c) > M + 1] . (46) 



Take M = [| logc|/2(a + fi)], for some a > 0, [a] denoting integer part of a. Then the first 
term on the R.H.S. of (46) 



< 



1 nr / — | logc| 2 o |logc|\ 
1 A/T /^-|l°g c l A I lo S c l q \ 



Mc a ' a2 
c| logc | 



if we take a such that a/cr 2 > 1. In the following we take a = a 2 . 
The second term in (46), 



c lo. 



to 1 n=A/+l v ' v 



1 / g |logc| \ Pq [e^] 

-c|logc| eXP V o 2 ) e^+i) 



OO ' 

n=M+l ^ 



2o 2 



(47) 
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-(M+1)9q 



1 4^)^ 



cllogcl ^\ a 2 J eiW+V , -o 

-fln 

< K { E [e^l e-l 10 ^', (49) 

c| log c| 

where 7/ = 77 + 0g/2<r 2 2(a + O ) and K = e' v ' M ^ /l - ei$. 
Recall that E [e 7 ' 1 "^] < 00 if < 77 < i?o (see proof of Lemma 5). It can be seen that 77 and 

771 are independent constants. Moreover it is known from [9, Theorem 1.5] that, as | logc| — > 00, 

r _ 1 1 fl M x i,i) -1 

£ [e ,)Tl(c) ] < ^'oe^ol Iogc l, where (p l is the minimal value such that E e v ° og 'o,i(*i,i> = e '? j 
and </?' is a function of ip l . Therefore, due to (13), E[e r,T ^] < y^e^ ' logc ' L , as c — > 0, where 
y9o = 52i=i V^o an ^ ^ s a function of <^o- Then (49) is upper-bounded by 



r^c-^c" ->0, (50) 

c| log c\ 



in 



if 77' > 1 + <p + 

Pfa 

Combining (42), (44) and (50), we get — r ->■ as c ->■ 0. 

c log c 

p 

Following similar steps with obvious modifications we can also get — r — > as c — > 0. ■ 

c|logc| 
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